Towards Scalable Event Tracing for High End Systems
نویسندگان
چکیده
Although event tracing of parallel applications offers highly detailed performance information, tracing on current leading edge systems may lead to unacceptable perturbation of the target program and unmanageably large trace files. High end systems of the near future promise even greater scalability challenges. Development of more scalable approaches requires a detailed understanding of the interactions between current approaches and high end runtime environments. In this paper we present the results of studies that examine several sources of overhead related to tracing: instrumentation, differing trace buffer sizes, periodic buffer flushes to disk, system changes, and increasing numbers of processors in the target application. As expected, the overhead of instrumentation correlates strongly with the number of events; however, our results indicate that the contribution of writing the trace buffer increases with increasing numbers of processors. We include evidence that the total overhead of tracing is sensitive to the underlying file system.
منابع مشابه
Trace-based performance analysis for the petascale simulation code FLASH
Performance analysis of applications on modern high-end Petascale systems is increasingly challenging due to the rising complexity and quantity of the computing units. This paper presents a performance analysis study with the Vampir performance analysis tool suite that examines the application behavior as well as the fundamental system properties. The study is done on the ORNL’s Cray XT4 system...
متن کاملEnd-to-end tracing considered essential
Concurrency and complexity are major obstacles to understanding application performance in high-performance systems. We advocate end-to-end event tracing as the correct way to expose performance information for both human and automated analysis. We describe its advantages over traditional performance counter data, and illustrate its uses in performance visualization, prediction for capacity pla...
متن کاملAn Investigation of Tracing Overheads on High End Systems
Although event tracing of parallel applications offers highly detailed performance information, tracing on current leading edge systems may lead to unacceptable perturbation of the target program and unmanageably large trace files. High end systems of the near future promise even greater scalability challenges. In this work we identify and quantify the overheads of application tracing. We repor...
متن کاملScalable Reactive Stream Processing Using DDS and Rx
Event-driven design is fundamental to developing resilient, responsive, and scalable reactive systems as it supports asynchrony and loose coupling. The OMG Data Distribution Service (DDS) is a proven event-driven technology for building data-centric reactive systems because it provides the primitives for decoupling system components with respect to time, space, quality-of-service, and behavior....
متن کاملScalable Reactive Stream Processing Using DDS and Rx: An Industry-Academia Collaborative Research Experience
Event-driven design is fundamental to developing resilient, responsive, and scalable reactive systems as it supports asynchrony and loose coupling. The OMG Data Distribution Service (DDS) is a proven event-driven technology for building data-centric reactive systems because it provides the primitives for decoupling system components with respect to time, space, quality-of-service, and behavior....
متن کامل